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Abstract 

Computing maximum independent sets in graphs is an important problem in computer 
science. In this paper, we develop an evolutionary algorithm to tackle the problem. The core 
innovations of the algorithm are very natural combine operations based on graph partitioning 
and local search algorithms. More precisely, we employ a state-of-the-art graph partitioner 
to derive operations that enable us to quickly exchange whole blocks of given independent 
sets. To enhance newly computed offsprings we combine our operators with a local search 
algorithm. Our experimental evaluation indicates that we are able to outperform state-of-the- 
art algorithms on a variety of instances. 


1 Introduction 

In a simple and connected graph, an independent set is a subset of the nodes such that every pair 
of nodes that can be formed from the set is not adjacent. The maximum independent set problem 
is then to find the independent set in the graph with the largest possible cardinality. There are 
lots of applications that benefit from large independent sets such as information retrieval, signal 
transmission analysis, classification theory, economics, scheduling or computer vision [|9l. As a 
more specific example, finding large independent sets is useful in map labeling IfTTl where one 
wants to maximize the number of visible non-overlapping labels on a map. Here, a graph model is 
built such that labels correspond to nodes and there is an edge between two nodes if the associated 
labels are overlapping. It is easy to see that a maximum independent set in the model yields a 
maximum number of non-overlapping labels. 

The maximum independent set problem is closely related to the maximum clique problem 
and the minimum vertex cover problem. More precisely, the complement of an independent set 
X results in a vertex cover V\T and an independent set is a clique in the complement graph G. 
However, note that results from the maximum clique problem are usually only partially transferable 
to practical algorithms for the maximum independent set problem since building the complement 
of sparse graphs yields dense graphs. It is well known that all of these problems are NP-hard IfTOl . 
Thus, one relies on heuristic algorithms to find good solutions on large graphs. 

Most of the work in literature considers heuristics and local search algorithms for the maximum 
clique problem (see for example 0[[5l[l3> 16112011141 ). These algorithms keep a single solution 


and try to improve it by using node deletions, insertions, swaps as well as the concept of plateau 
search. In this context, plateau search only accepts moves that do not change the objective function 
of the optimization problem. Heuristics usually employ node swaps to achieve that. A node swap 
refers to the replacement of a node by one of its neighbors; Hence, a node swap cannot directly 
increase the size of the independent set but can yield a situation where an additional node may get 
inserted to the solution. A very successful approach for the maximum clique problem has been 
presented by Grosso et al. 03}. In addition to the plateau search approach, different diversification 
operations are performed and restart rules are added. In the independent set context, Andrade 
et al. 0} extended the notion of swaps to (j. A')-swaps. A (j, /c)-swap removes j nodes from the 
current solution and inserts k nodes. The authors present a fast linear-time implementation that, 
given a maximal solution, can find a (1, 2)-swap or prove that none exists. We implemented the 
algorithm and use it within our evolutionary algorithm to improve newly computed offsprings. 

There are very few papers considering evolutionary algorithms for the maximum independent 
set problem. The general idea behind evolutionary algorithms is to use mechanisms which are 
highly inspired by biological evolution such as selection, mutation, recombination and survival of 
the fittest. An evolutionary algorithm starts with a population of individuals (in our case indepen¬ 
dent sets of the graph) and evolves the population into different populations over several rounds. In 
each round, the evolutionary algorithm uses a selection rule based on the fitness of the individuals 
of the population to select good individuals and combine them to obtain improved offspring 02}. 

Back and Khuri [{3} and Borisovsky and Zavolovskaya [[6} use fairly similar approaches. They 
encode solutions as bitstrings such that the value at position i equals one if and only if node i is 
in the current solution. In both cases a classic two-point crossover is used which randomly selects 
two crossover points pi,p 2 - Then all bits in between these positions are exchanged between both 
input individuals. Note that this likely results in invalid solutions. To guide the search towards 
valid solutions a penalty approach is used. A major drawback of the work by Back and Khuri [[3} is 
that the authors only test their algorithm on synthetic instances. Moreover, in both cases the graphs 
under consideration are very small. 

The main contribution of our paper is a very natural evolutionary framework for the com¬ 
putation of large maximal independent sets. The core innovations of the algorithm are combine 
operations based on graph partitioning and local search algorithms. More precisely, we employ 
the state-of-the-art graph partitioner KaHIP [|2D to derive operations that enable us to quickly 
exchange whole blocks of given individuals. The newly computed offsprings are then improved 
using a local search algorithm. In contrast to previous evolutionary algorithms, each computed 
offspring is valid. Hence, we only allow valid solutions in our population and thus are able to use 
the cardinality of the independent set as a fitness function. The rest of paper is organized as fol¬ 
lows. We begin in Section[2]by introducing basic concepts and related work. We describe the core 
components of our evolutionary algorithm in Section [3j This includes a number of partitioning 
based combine operators that take two individuals as input as well as combine operators that can 
take multiple individuals as input. A summary of extensive experiments done to tune the algorithm 
and evaluate its performance is presented in Section |4j Experiments indicate that our algorithm 
computes very good independent sets and outperforms state-of-the-art algorithms on large variety 
of instances. Finally, we conclude with Section [5] 
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2 Prelimiaries 


2.1 Basic Concepts 

Let G — (V = {0,. .., n — 1}, E) be an undirected graph with n = \V\ and m = \E\. The set 
N(v) := { u : {w, ?/} G E} denotes the neighbors of v. The complement of a graph is defined as 
G = (V, E) with E being the complement of E. An independent set is a subset X C V, such 
that there are no adjacent nodes in X. It is maximal, if it is not a subset of any larger independent 
set. The independent set problem is that of finding the maximum cardinality set among all possible 
independent sets. A vertex cover is a subset of nodes C C V, such that every edge e G E is at least 
incident to one node within the set. The minimum vertex cover problem asks for the vertex cover 
with the minimum number of nodes. It is worth mentioning that the complement of a vertex cover 

V \ C always is an independent set by definition. A clique is a subset of the nodes Q C V such 
that there is an edge between all pairs of nodes from Q. 

A fc-way partition of a graph is a division of V into blocks of nodes Vi,... ,V k , i.e. Vj U • • • U14 = 

V and Vi (T Vj = 0 for i 4 j- A balancing constraint demands that V?’ G {1../;:} : V < 
Xmax : = (1 + G r^l f° r some imbalance parameter e. The objective is to minimize the total cut 
Yhicj w (Eij ) where E^ := {{«, v} G E : u G V, v G Vj}. The set of cut edges is also called edge 
separator. The k-node separator problem asks to find k + 1 blocks, Vi, V 2 ,..., V k and a separator 
S, that partition V, such that there are no edges between the blocks. Again, a balancing constraint 
demands \V t \ < (1 + e) [|V|//c]. However, there is no balancing constraint on the separator S. The 
objective is to minimize the size of the separator \S\. Note that removing the set S from the graph 
results in at least k connected components and that the blocks V t itself do not need to be connected 
components. By default, our initial inputs will have unit edge and node weights. 

2.2 Detailed Related Work 

We now discuss algorithmical details of the algorithm by Andrade et al. [HI. We call the algorithm 
ARW as an abbreviation for Andrade, Resende and Werneck. While we compare our algorithm 
against ARW, we also use it within our algorithm to improve newly created offsprings. Moreover, 
we shortly present the KaHIP graph partitioning framework since we use it to compute partitions 
and node separators. 

ARW. One iteration of the ARW algorithm consists of a perturbation and a local search step. The 
ARW local search algorithm uses simple 2-improvements or (1, 2)-swaps to gradually improve a 
single current solution. A (j, /;:)-swap removes j nodes from the solution and then inserts k new 
nodes into it. A (l,2)-swap in particular removes a single node from the solution and adds two 
other free nodes. A node is called free, if none of its neighbouring nodes can be found in the current 
solution. The tightness of a node r(v) is the number of neighbouring solution nodes. Hence, free 
nodes have zero tightness. The simple version of the local search algorithm then iterates over all 
nodes of the graph and looks for a (1,2)-swap. It is shown, that this procedure can find a valid 
(1, 2)-swap in linear time O(rri), if it exists. This is achieved by using a data structure that allows 
insertion and removal operations on nodes in time proportional to their degree. The data structure 
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basically divides the nodes into solution nodes, free nodes and non-free non-solution nodes. The 
perturbation step used for diversification, forces nodes into the solution and removes neighboring 
nodes as necessary. In most cases, one node is forced into the solution per iteration. With a small 
probability the number of forced nodes / is set to higher value: / is set to i + 1 with probability 
1/2*. Moreover, the current node to be forced into a solution is picked from a number of random 
candidates. Among those candidates the vertex that has been outside the solution for the longest 
time is picked. We refer the reader to original paper for more details about the ARW algorithm. 
There is also an even faster incremental version of the algorithm that maintains a list of candidates. 
We use this version of the algorithm here. 

KaHIP. Karlsruhe High Quality Partitioning - is a family of graph partitioning programs that 
tackle the balanced graph partitioning problem f!2Tl [22]. The algorithms in KaHIP have been able 
to compute the best results in various benchmarks. It implements different sequential and parallel 
algorithms to compute k -way partitions and node separators. In this work, we use the sequential 
multi-level graph partitioner KaFFPa (Karlsruhe Fast Flow Partitioner) to obtain partitions and 
separators for the graphs. In particular, we use specialized partitioning techniques based on multi¬ 
level size-constrained label propagation [|T8ll . 


3 Evolutionary Components 

We now discuss the main contributions of the paper. We begin by outlining the general structure 
of our evolutionary algorithm and then explain how we build the initial population. Finally, we 
present our new combine operations and the methods we use for mutation. 

3.1 General Structure 

As previous work 01 [6J we use bitstrings as a natural way to represent individuals/solutions in our 
population. More precisely, an independent set X is represented as an array s = {0, l} n where 
s[i>] = 1 if and only if v £ 1. The general structure of our evolutionary algorithm is very simple. 
Our algorithm starts with the creation of a population of individuals (in our case independent sets 
in the graph) and evolves the population into different populations over several rounds until a 
stopping criterion is reached. 

In each round, our evolutionary algorithm uses a selection rule that is based on the fitness 
of the individuals (in our case the size of the independent set) of the population to select good 
individuals and combine them to obtain improved offspring. In contrast to previous work 00, 
our combine and mutation operators always create valid independent sets. Hence, we use the size 
of the independent set as a fitness function. That means that there is no need to use a penalty 
function to ensure that the final individuals generated by our algorithm are independent sets. As 
we will see later when an offspring is generated it is possible that it is a non-maximal independent 
set. Hence, we apply one iteration of ARW local search without the perturbation step to ensure that 
it is locally maximal and apply a mutation operation to the offspring. We use mutation operations 
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Algorithm 1 Steady State Evolutionary Algorithm with Local Search 
create initial population P 
while stopping criterion not fulfilled 
select parents X \, X 2 from P 
combine X x with X 2 to create offspring 0 
ARW local search+mutation on offspring O 
evict individual in population using O 
return the fittest individual that occurred 


since it is of major importance to keep the diversity in the population high fl2), i.e. the individuals 
should not become too similar, in order to avoid a premature convergence of the algorithm. 

We then use an eviction rule to select a member of the population and replace it with the new 
offspring. In general one has to take both into consideration, the fitness of an individual and the 
distance between individuals in the population [O. Our algorithm evicts the solution that is most 
similar to the newly computed offspring among those individuals of the population that have a 
smaller or equal objective than the offspring itself. Once an individual has been accepted into 
the population we further refine it using additional iterations of the ARW algorithm. The general 
structure of our evolutionary algorithm follows the steady-state approach [jU which generates only 
one offspring per generation. We give an outline in Algorithm |Tj 

3.2 Initial Solutions 

We use three different approaches to create initial solutions. Each time we create an individual for 
the population we pick one of the approaches uniformly at random. The first and most simplistic 
way is to start from an empty independent set and add nodes at random until no further nodes 
can be added. To ensure that adding a node results in a valid independent set we have to check if 
the node is free. We do this by simply checking if any of the surrounding nodes is already in the 
set. The method adds a decent amount of diversity during the construction phase, which over an 
extended period of time can lead to good solutions. 

Secondly, we use a greedy approach similar to Andrade et al. flU. Starting from an empty solu¬ 
tion, we always add the node with the least residual degree which is the number of free neighbors. 
After a node is added to the solution, we remove all its neighbouring nodes from the graph and 
update the residual degree of their neighbors. We repeat the procedure until no further node can be 
added. The implementation is done using a simple bucket priority queue which groups nodes into 
buckets based on their residual degree. This allows us to pick a random node each time multiple 
nodes share the same residual degree. 

The last approach that we use to create initial solutions is also a greedy one. Here, we take a 
detour and generate an independent set by computing a vertex cover. We first create a vertex cover 
and then compute its complement to get an independent set. The algorithms also starts with an 
empty solution and then always adds the node that will cover the most currently uncovered edges. 
We repeat this until all edges are covered and then return the corresponding independent set. Note 
that the two greedy algorithms compute different independent sets (e.g. consider a path with five 
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nodes). While the first approach always maintains an independent set and tries to improve it, the 
second approach can only return an independent set once the algorithm has terminated. 


3.3 Combine Operations 


We perform different kinds of combine operations which are all based on graph partitioning. The 
main idea of our operators is to use a partition of the graph to exchange whole blocks of solution 
nodes. In general our combination operators try to generate new independent sets that are not nec¬ 
essarily maximal. We then perform a maximization step that adds as many free nodes as possible. 
Afterwards, we apply a single iteration of the ARW local search algorithm to ensure that our solu¬ 
tion is locally optimal. Depending on the type of the operator, we use a node separator or an edge 
separator of the graph that has been computed by the graph partitioning framework KaHIP. As a 
side note, small edge or node separators are vital for our combine operations to work well. This 
is due to the fact that large separators in the combine operations yield offsprings that are far from 
being maximal. Hence, the maximization step performs lots of fixing and the computed offspring 


is not of high quality. This is supported by experiments presented in Section 4.1 


The first and the second operator need precisely two input solutions while our third operator 
is a multi-point combine operator - it can take multiple input solutions. In the first case, we use a 
simple tournament selection rule Ifl9l to determine the inputs, i.e. X x is the fittest out of two random 
individuals ri, r 2 from the population. The same is done to select X 2 . Note that due to the fact that 
our algorithms are randomized, a combine operation performed twice using the same parents can 
yield a different offspring. 


Node Separator Combination. In its simplest form, the operator starts by computing a node 
separator V = V t U V 2 U S of the input graph. We then use S' as a crossover point for our operation. 
The operator generates two offsprings. More precisely, we set 0\ = (V) n 2)) U (V 2 fl X 2 ) and 
0 2 = (V 1 nl 2 )U(V 2 nX 1 ). In other words, we exchange whole parts of independent sets from 
the blocks V\ and V 2 of the node separator. Note that the exchange can be implemented in time 
linear in the number of nodes. Recall that the definition of a node separator implies that there are 
no edges running between V\ and V 2 . Hence, the computed offsprings are independent sets, but 
may not be maximal since separator nodes have been ignored and potentially some of them can be 
added to the solution. We maximize the offsprings by using the greedy independent set algorithm 
from Section |3.2| The operator finishes with one iteration of the ARW algorithm to ensure that 


we reached a local optimum and to add some diversification. An example illustrating the combine 
operation is shown in Figure |T] 


Edge Separator Combination. This operator computes offsprings by taking a detour over vertex 
covers. It starts by computing a bipartition V — V\ U V 2 of the graph. Let C r be the vertex cover 
V\Ii. We define temporary vertex cover offsprings similar to before: D\ = (C\ fl Vi) U (C 2 fl V 2 ) 
and I)> = (Ci fl V 2 ) U (C 2 IT Vi). Unfortunately, it is possible that an offspring created this 
way contains some non-covered edges. These edges can only be a subset of the cut edges of 
the partition. We want to add as little nodes as possible to our solution to fix this. Hence, we 
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Figure 1: An example combine operation using a node separator V — V\ U V 2 U S. On top two 
input individuals/independent sets, Xi and X 2 , are shown. Bottom left: a possible offspring that 
uses the independent set of X x in block V\ and the independent set of X 2 in block V 2 . Bottom right: 
the improved offspring after ARW local search has been applied to improve the given solution and 
to add nodes from the separator to the independent set. 


add a minimum vertex cover of the bipartite graph induced by the non-covered cut edges to our 
vertex cover offspring. The minimum vertex cover in a bipartite graph can be computed using the 
Hopcroft-Karp algorithm. Afterwards, we transform the vertex cover back to an independent set, 
and follow our general approach by applying ARW local search to reach a local optimum. 

Multi-way Combination. Our last two operators are multi-point crossover operators that extend 
the previous two operators. Both of them divide the graph into a number of blocks k. Depending 
on the type of the operator, a node or edge separator is used. We start with the description of the 
node separator approach where V = V\ U ... U V k U S. The operator selects a number of parents. 
We then calculate the score for every possible pair of a parent X t and a block V 3 . The score of a 
pair is the number of the parents solution nodes inside the given block. We then select the parent 
with the highest score for each of the blocks to compute the offspring. As before, since we left 
out the separator nodes we use a maximization step to make the solution maximal and afterwards 
apply ARW local search to ensure that our solution is a local optimum. 

If we use an edge separator for the combination, we start with a k -way partition of the nodes 
V — V\ U ... U 14. This approach also computes scores for each pair of parent and block. This 
time the score of a pair is defined as the number of the vertex cover nodes of the complement of an 
independent set inside the given block. We then select the parent with the lowest score for each of 
the blocks to compute the offspring. As in the simple vertex cover combine operator, it is possible 
that some cut edges are not covered. We use the simple greedy vertex cover algorithm to fix the 
offspring since the graph induced by the non-covered cut edges is not bipartite anymore. We then 
once again complement our vertex cover to get our final offspring. 
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3.4 Mutation Operations 

After we performed a combine operation, we apply a mutation operator to introduce further diver¬ 
sification. Previous work [j3] [6J uses bit-flipping for mutation, i.e. every bit in the representation 
of a solution has a certain probability of being flipped. We can not use this approach since our 
population only allows valid solutions. Instead we perform forced insertions of new nodes into the 
solution and remove adjacent solution nodes if necessary as in the perturbation routine of the ARW 
algorithm. Afterwards we perform ARW local search to improve the perturbed solution. 

3.5 Miscellanea 

Instead of computing a new partition for every combine operation, we hold a pool of partitions 
and separators that is computed in the beginning. A combine operation then picks a random par¬ 
tition or node separator from the pool. If the combine operations have been unsuccessful for too 
many iterations, we compute a fresh set of partitions. In our experiments we used two-hundred 
unsuccessful combine operations as a threshold. Additionally, we have to ensure that the partitions 
created for the combine operations are sufficiently different over multiple runs. However, although 
KaHIP is a randomized algorithm, small cuts in a graph may be similar. To avoid similar cuts and 
increase diversification of the partitions and node separators, we additionally give KaHIP a random 
imbalance e e md [0.05, 0.75] to solve the partitioning problem. Additionally, we tried one more 
combine operator based on set intersection. This operator computes an offspring by keeping the 
nodes that are in both inputs which is by definition an independent set. However, our experiments 
with the operator did not yield good results so that we omit further investigations here. 


4 Experimental Evaluation 

Methodology. We have implemented the algorithm described above (EvoMIS) using C++ and 
compiled all algorithms using gee 4.63 with full optimization’s turned on (-03 flag). We mainly 
compare our algorithm against the ARW algorithm since it has a relatively clear advantage in 
Resende et al. m. The algorithm by Grosso et al. Ifl4l has originally been formulated for the 
maximum clique problem. Andrade et al. [Q3 used an implementation of the algorithm for the 
maximum independent set problem. Hence, we also compare against the results of the algorithm 
by Grosso et al. presented in the paper of Andrade et al. Q]|. Additionally, we compare ourselves 
with our implementation of the evolutionary algorithm presented by Back and Khuri [|3l . 

Unless otherwise mentioned, we perform five repetitions where each algorithm that we run gets 
ten hours of running time to compute a solution. Each run was made on a machine that is equipped 
with two Quad-core Intel Xeon processors (X5355) which run at a clock speed of 2.667 GHz. It 
has 2x4 MB of level 2 cache each, 64 GB main memory and runs Suse Linux Enterprise 10 SP 1. 
We used the fastsocial configuration of the KaHIP v0.6 graph partitioning package |[2Til to obtain 
graph partitions and node separators. The test results for the ARW algorithm were obtained by 
using the original algorithm from Andrade et al. 0]. Within the evolutionary algorithm we used 
our own implementation of the ARW algorithm. 
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We mostly present two kinds of data: maximum values, average values, minimum values as 
well as plots that show the evolution of solution quality. We now explain how we compute the con¬ 
vergence plots. Whenever an algorithm creates a new best independent set S it reports a tuple (t, 
151), where the time stamp t is the currently elapsed time and |5| refers to the size of the indepen¬ 
dent set that has been created. Since we perform multiple repetitions, the final plots correspond to 
average values over these repetitions. To compute these we take the time stamps of all repetitions 
and sort them in ascending order. For each time stamp in this series, we report the average value 
of the best solution size of each repetition at that time. 

Algorithm Configuration. After an extensive evaluation of the parameters lUTTl . we fixed the 
population size to two hundred fifty, the partition pool size to thirty, the number of ARW iterations 
to 15 000 as well as the number of blocks used for the multi-way combine operations to sixty-four. 
In each iteration, one of our three combine operations is picked uniformly at random. However, 
our experiments indicate that our algorithm is not too sensitive about the precise choice of the 
parameters. We mark the instances that have also been used for the parameter tuning in ifTTl in 
Appendix |A| with a *. 

Instances. We use graphs from various sources to test our algorithm. We divide them into five 
categories: social networks, meshes, road networks, networks from finite element computations as 
well as networks stemming from matrices. Social networks include citation networks, autonomous 
systems graphs or web graphs taken from the 10th DIM ACS Implementation Challenge bench¬ 
mark set 01. Road networks and meshes are taken from Andrade et al. dU and have been kindly 
provided by Renato Wemeck. Meshes are dual graphs of triangular meshes. Networks stemming 
from finite element computations have been taken from Chris Walshaw’s benchmark archive Il23l . 
Graphs stemming from matrices have been taken from the Florida Sparse Matrix Collection Q. 
We randomly selected one from each group of all real, symmetric matrices having between 10K 
and 65K columns. A graph is derived by inserting a node for each column and creating an edge 
between two nodes u, v if the corresponding matrix entry is non-zero. Self-loops are removed from 
the graphs. 

4.1 Main Results 

We now shortly summarize the main results of our experiments. First of all, in 50 out of the 67 
instances, we either improve or reproduce the maximum result computed by the ARW algorithm. 
Our algorithm computes a maximum solution that is strictly larger than the maximum solution 
computed by the ARW algorithm in 21 cases. Contrarily, in 17 cases the maximum result of the 
ARW algorithm is larger then the maximum result of our algorithm. When looking at average 
values, we get 23 cases in which our algorithm strictly outperforms the ARW algorithm, and 17 
cases for the opposite direction. Remarkably, when looking at the graphs obtained from the Florida 
Sparse Matrix collection, the average value of the ARW algorithm only outperforms our algorithm 
on one instance. The mesh family that we use in this paper has also been used in the original ARW 
paper HD- We like to stress that most of the maximum results of the ARW algorithm are strictly 
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Figure 2: Solution size evolving over time for four instances from the mesh family: feline and 
gameguy [top] as well as buddha and dragon [bottom]. 


larger than the maximum values originally reported by Andrade et al. 03 (including the maximum 
values presented there of the algorithm by Grosso et al. 031). Except for four instances the same 
holds for our algorithm. On these four instances, our algorithm is worse than the original maximum 
value of the ARW algorithm. On the mesh family, in 8 out of 14 cases our algorithm computes 
the best result ever reported in literature. On road networks and the largest graphs from the mesh 
family as well as Walshaw family the ARW algorithm outperforms our algorithm. We tried to 
give both algorithms more time, i.e. a whole day of computation, but did not see much different 
results. Lastly, there is an interesting observation on social networks, that is in 5 out of 9 cases the 
minimum, average and maximum result produced by both algorithms are precisely the same. We 
suspect that these instances are in a sense easy and that both algorithms compute the optimal result 
or are very close to the optimum. We provide detailed per instance results in Appendix [A} 

Figure [2] shows how solution quality evolves over time on four example instances from the mesh 
family for both algorithms. As one would suspect, our algorithm almost keeps its level of solution 
quality in the beginning since it has to build the full population before it can start with combine 
and mutation operations. Contrarily, the ARW algorithm can directly start with local search and 
improve its solution. Hence, the solution quality of the ARW algorithm rises above the solution 
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quality of our algorithm. As soon as our algorithm finished to compute the population, solution 
quality starts to improve and eventually the size of the computed independent sets becomes better 
than the solution quality of the ARW algorithm. 

We also implemented the algorithm presented by Back and Kliuri 0. The algorithm uses a 
two-point crossover as a combine operation, as well as a bit-flip approach for mutation. Solutions 
created by the combine and mutation operations can be invalid. Hence, a penalty approach is used 
to deal with invalid solution candidates. In the original paper, the algorithm is only tested on small 
synthetic or random instances (< 200 nodes). We tested the algorithm on the four smallest graphs 
from the mesh family and gave the algorithm ten hours of time to compute a solution. However, 
the best valid solution created during the course of the algorithm never exceeded the size of the 
best solution after the initial population has been created. This is due to the fact that the two-point 
crossover and the mutation operations found valid solutions very rarely so that the average solution 
quality of the population degrades over time. On average, final solution quality of the algorithm 
has been more than 20% worse than the final result of our algorithm. Due to the bad solution 
quality observed, we did not perform additional experiments with this algorithm. 

The Role of Graph Partitioning. To estimate the influence of good partitionings in this context, 
we performed an experiment in which partitions of the graph have been obtained by simple breadth 
first searches. More precisely, we obtain a two-way partition of the graph using a breadth first 
search starting from a random node. Every node touched by the breadth first search is added to the 
first block, and every node not touched by the breadth first search is added to the second block. 
The breadth first search is stopped as soon as a specified number of nodes has been touched. In our 
experiments, using this approach instead of the approach that uses a graph partitioner to compute 
a partition yields significantly worse results. The influence of all the different combine operators 
that we use here is presented in the thesis [H71 . 


5 Conclusion 

We presented a very natural evolutionary framework for the computation of large maximal in¬ 
dependent sets. Our core innovations are combine operations that are based on graph partitioning 
and local search algorithms. More precisely, our combine operations enable us to quickly exchange 
whole blocks of given individuals. In contrast to previous evolutionary algorithms for the problem, 
our operators are able to guarantee that the created offspring is valid. Experiments indicate that our 
algorithms outperforms state-of-the-art algorithms on large variety of instances - some of which 
are better than every reported in literature. Important future work includes a coarse-grained par¬ 
allelization of our approach which can be done by using an island-based approach. Moreover, it 
would be interesting to improve the solution quality of our approach on road networks and to com¬ 
pare our algorithms with exact approaches. Additionally, it would be interesting to overcome the 
slow start of our algorithm due to the initialization of the population. For example, one could try 
to adjust the size of the population dynamically. 
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A Detailed per Instance Results 


Graph 

EvoMIS 

ARW 

Name 

n 

Avg. 

Max. 

Min. 

Avg. 

Max. 

Min. 

enron 

69 244 

62 811 

62811 

62811 

62811 

62811 

62 811 

gowalla 

196 591 

112 369 

112369 

112 369 

112 369 

112369 

112 369 

citation 

268 495 

150380 

150380 

150 380 

150380 

150380 

150 380 

cnr-2000* 

325 557 

229981 

229991 

229 976 

229955 

229966 

229 940 

google 

356 648 

174072 

174072 

174 072 

174072 

174072 

174 072 

coPapers 

434 102 

47 996 

47996 

47 996 

47 996 

47996 

47 996 

skitter* 

554930 

328519 

328520 

328 519 

328 609 

328619 

328 599 

amazon 

735 323 

309774 

309778 

309 769 

309792 

309793 

309 791 

in-2004* 

1382908 

896581 

896585 

896 580 

896477 

896 562 

896 408 


Table 1: Results for social networks. 
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Graph 

EvoMIS 

ARW 

Name 

n 

Avg. 

Max. 

Min. 

Avg. 

Max. 

Min. 

beethoven 

4419 

2 004 

2004 

2004 

2 004 

2004 

2004 

cow 

5 036 

2 346 

2346 

2 346 

2 346 

2346 

2 346 

venus 

5 672 

2 684 

2684 

2 684 

2 684 

2 684 

2684 

fandisk 

8 634 

4 075 

4075 

4075 

4 073 

4074 

4072 

blob 

16068 

7 249 

7250 

7 248 

7 249 

7250 

7 249 

gargoyle 

20000 

8 853 

8854 

8 852 

8 852 

8 853 

8 852 

face 

22 871 

10218 

10218 

10218 

10217 

10217 

10217 

feline 

41262 

18 853 

18854 

18851 

18 847 

18 848 

18 846 

gameguy 

42 623 

20726 

20727 

20726 

20 670 

20690 

20659 

bunny* 

68 790 

32 337 

32343 

32 330 

32 293 

32 300 

32 287 

dragon 

150000 

66 373 

66 383 

66 365 

66 503 

66505 

66 500 

turtle 

267 534 

122 378 

122 391 

122 370 

122 506 

122584 

122444 

dragonsub 

600000 

281403 

281436 

281384 

282 006 

282066 

281954 

ecat 

684496 

322 285 

322 357 

322222 

322 362 

322529 

322269 

buddha 1087 716 

478 879 

478 936 

478 795 

480942 

480969 

480921 


Table 2: Results for mesh type graphs. 


Graph 

EvoMIS 

ARW 

Name 

n 

Avg. 

Max. 

Min. 

Avg. 

Max. 

Min. 

crack 

10 240 

4 603 

4603 

4 603 

4 603 

4603 

4 603 

vibrobox 

12 328 

1852 

1852 

1852 

1850 

1851 

1849 

4elt 

15 606 

4944 

4944 

4 944 

4942 

4944 

4940 

cs4 

22 499 

9172 

9177 

9170 

9173 

9174 

9172 

bcsstk30 

28 924 

1783 

1783 

1783 

1783 

1783 

1783 

bcsstk31 

35 588 

3 488 

3488 

3 488 

3 487 

3 487 

3 487 

fe_pwt 

36519 

9 309 

9310 

9 309 

9310 

9310 

9 308 

brack2 

62 631 

21417 

21417 

21417 

21416 

21416 

21415 

fe_tooth 

78 136 

27 793 

27793 

27 793 

27 792 

27 792 

27 791 

fe_rotor 

99 617 

22022 

22026 

22019 

21974 

22030 

21902 

598a 

110971 

21826 

21829 

21824 

21891 

21894 

21888 

wave 

156317 

37 057 

37063 

37 046 

37 023 

37 040 

36999 

fe_ocean 

143 437 

71390 

71576 

71233 

71492 

71655 

71291 

auto 

448 695 

83 935 

83 969 

83 907 

84462 

84478 

84453 


Table 3: Results for Walshaw benchmark graphs. 


Graph 

EvoMIS 

ARW 

Name 

n 

Avg. 

Max. 

Min. 

Avg. 

Max. 

Min. 

ny 

264 346 

131384 

131395 

131377 

131481 

131485 

131476 

bay 

321270 

166 329 

166 345 

166318 

166 368 

166 375 

166 364 

col 

435 666 

225 714 

225 721 

225 706 

225 764 

225 768 

225 759 

f la 

1070 376 

549093 

549 106 

549 072 

549581 

549 587 

549 574 


Table 4: Results for road networks. 
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Graph 

EvoMIS 

ARW 

Name 

n 

Avg. 

Max. 

Min. 

Avg. 

Max. 

Min. 

Oregon-1 

11 174 

9512 

9512 

9512 

9512 

9 512 

9512 

ca-HepPh 

12006 

4994 

4994 

4 994 

4 994 

4 994 

4 994 

skirt 

12 595 

2 383 

2383 

2 383 

2 383 

2 383 

2 383 

cbuckle 

13681 

1097 

1097 

1097 

1097 

1097 

1097 

cyl6 

13681 

600 

600 

600 

600 

600 

600 

case9 

14453 

7 224 

7224 

7 224 

7 224 

7 224 

7 224 

rajat07 

14 842 

4971 

4971 

4971 

4 971 

4 971 

4 971 

Dubcoval 

16129 

4096 

4096 

4096 

4 096 

4 096 

4 096 

olaf u 

16146 

735 

735 

735 

735 

735 

735 

bodyy6 

19 366 

6232 

6233 

6230 

6 226 

6 228 

6 224 

raefsky4 

19779 

1055 

1055 

1055 

1053 

1053 

1053 

smt 

25710 

782 

782 

782 

780 

780 

780 

pdblHYS 

36417 

1078 

1078 

1078 

1070 

1071 

1070 

c-57 

37 833 

19997 

19997 

19997 

19 997 

19 997 

19 997 

copter2 

55 476 

15 192 

15195 

15 191 

15 186 

15 194 

15 179 

TSOPF_FS_b30 0_c2 

56813 

28 338 

28338 

28 338 

28 338 

28 338 

28 338 

c-67 

57 975 

31257 

31257 

31257 

31257 

31257 

31257 

dixmaanl 

60000 

20000 

20000 

20000 

20000 

20000 

20000 

blockqpl 

60012 

20011 

20011 

20011 

20 011 

20011 

20011 

Ga3As3H12 

61349 

8118 

8151 

8 097 

8 061 

8 124 

7 842 

GaAsH6 

61349 

8 562 

8 572 

8 547 

8519 

8 575 

8351 

cant 

62208 

6260 

6260 

6 260 

6 255 

6 255 

6 254 

ncvxqp5 

62 500 

24 526 

24 537 

24 510 

24 580 

24 608 

24 520 

crankseg_2 

63 838 

1735 

1735 

1735 

1735 

1735 

1735 

c-68 

64 810 

36 546 

36546 

36 546 

36 546 

36 546 

36 546 


Table 5: Results for graphs from Florida Sparse Matrix collection. 
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